Discovering Generalized Episodes Using Minimal Occurrences

نویسندگان

  • Heikki Mannila
  • Hannu Toivonen
چکیده

Sequences of events are an important special form of data that arises in several contexts, including telecommunications, user interface studies, and epidemiology. We present a general and flexible framework of specifying classes of generalized episodes. These are recurrent combinations of events satisfying certain conditions. The framework can be instantiated to a wide variety of applications by selecting suitable primitive conditions. We present algorithms for discovering frequently occurring episodes and episode rules. The algorithms are based on the use of minimal occurrences of episodes; this makes it possible to evaluate confidences of a wide variety of rules using only a single analysis pass. We present empirical results on t,he behavior of t.he algorithms on events stemming from a WWW log.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Algorithms to Discover Complete Frequent Episodes in Sequences

Serial episode is a type of temporal frequent pattern in sequence data. In this paper we compare the performance of serial episode discovering algorithms. Many different algorithms have been proposed to discover different types of episodes for different applications. However, it is unclear which algorithm is more efficient for discovering different types of episodes. We compare Minepi and WinMi...

متن کامل

Inferring Neuronal Network Connectivity using Time-constrained Episodes

Discovering frequent episodes in event sequences is an interesting data mining task. In this paper, we argue that this framework is very effective for analyzing multi-neuronal spike train data. Analyzing spike train data is an important problem in neuroscience though there are no data mining approaches reported for this. Motivated by this application, we introduce different temporal constraints...

متن کامل

Mining Closed Episodes from Event Sequences Efficiently

Recent studies have proposed different methods for mining frequent episodes. In this work, we study the problem of mining closed episodes based on minimal occurrences. We study the properties of minimal occurrences and design effective pruning techniques to prune non-closed episodes. An efficient mining algorithm Clo_episode is proposed to mine all closed episodes following a breadth-first sear...

متن کامل

Discovering Associations between Climatic and Oceanic Parameters to MonitorDrought in Nebraska Using Data-Mining Techniques

Drought is a complex natural hazard that is best characterized by multiple climatological and hydrological parameters. Improving our understanding of the relationships between these parameters is necessary to reduce the impacts of drought. Data mining is a recently developed technique that can be used to interact with large databases and assist in the discovery of associations between drought a...

متن کامل

Discovering Neuronal Connectivity from Serial Patterns in Spike Train Data

Repeating patterns of precisely-timed activity across a group of neurons (called frequent episodes) are indicative of networks in the underlying neural tissue. This paper develops statistical methods to determine functional connectivity among neurons based on “non-overlapping” occurrences of episodes. We study the distribution of episode counts and develop a two-phase strategy for identifying f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996